Internal validation of predictive logistic regression models for decision-making in wildlife management
نویسندگان
چکیده
Predictive logistic regression models are commonly used to make informed decisions related to wildlife management and conservation, such as predicting favourable wildlife habitat for land conservation objectives and predicting vital rates for use in population models. Frequently, models are developed for use in the same population from which sample data were obtained, and thus, they are intended for internal use within the same population. Before predictions from logistic regression models are used to make management decisions, predictive ability should be validated. We describe a process for conducting an internal model validation, and we illustrate the process of internal validation using logistic regression models for predicting the number of successfully breeding wolf packs in six areas in the US northern Rocky Mountains. We start by defining the major components of accuracy for binary predictions as calibration and discrimination, and we describe methods for quantifying the calibration and discrimination abilities of a logistic regression model. We also describe methods for correcting problems of calibration and future predictive accuracy in a logistic regression model. We then show how bootstrap simulations can be used to obtain unbiased estimates of prediction accuracy when models are calibrated and evaluated within the same population from which they were developed. We also show how bootstrapping can be used to assess coverage rates and recalibrate the endpoints of confidence intervals for predictions from a logistic regression model, to achieve nominal coverage rates. Using the data on successfully breeding wolf packs in the northern Rocky Mountains, we validate that predictions from a model developed with data specific to each of six analysis areas are better calibrated to each population than a global model developed using all data simultaneously. We then use shrinkage of model coefficients to improve calibration and future predictive accuracy for the area-specific model, and recalibrate confidence interval endpoints to provide better coverage properties. Following this validation, managers can be confident that logistic regression predictions will be reliable in this situation, and thus that management decisions will be based on accurate predictions.
منابع مشابه
Matrix Sequential Hybrid Credit Scorecard Based on Logistic Regression and Clustering
The Basel II Accord pointed out benefits of credit risk management through internal models to estimate Probability of Default (PD). Banks use default predictions to estimate the loan applicants’ PD. However, in practice, PD is not useful and banks applied credit scorecards for their decision making process. Also the competitive pressures in lending industry forced banks to use profit scorecards...
متن کاملارائه مدلی جهت پیش بینی بیماری دیابت با استفاده از شبکه عصبی
Introduction: Meta-heuristic and combined algorithms have a great capability in modelling medical decision making. This study used neural networks in order to predict Type 2 Diabetes (T2D) among high risk individuals. Methods: This study was an applied research. Data from 545 individuals (diabetic and non-diabetic), in Diabetes Clinic of Hamedan University of Medical Sciences, we...
متن کاملCEO Emotional Intelligence and Board of Directors Efficiency
This article deals with the relationship existing between the emotional aspect and decision-making processes. More specifically, it examines the links between emotional intelligence, decision biases and effectiveness of the governance mechanisms. The primary purposes of this article are to: consider emotional intelligence like new research ideas that make important contributions to society; off...
متن کاملThe Comparison of Credit Risk between Artificial Neural Network and Logistic Regression Models in Tose-Taavon Bank in Guilan
One of the most important issues always facing banks and financial institutes is the issue of credit risk or the possibility of failure in the fulfillment of obligations by applicants who are receiving credit facilities. The considerable number of banks’ delayed loan payments all around the world shows the importance of this issue and the necessary consideration of this topic. Accordingly...
متن کاملRanking stocks of listed companies on Tehran stock exchange using a hybrid model of decision tree and logistic regression
Much research has introduced linear or nonlinear models using statistical models and machine learning tools in artificial intelligence to estimate Iran's rate of return. The primary purpose of these methods is simultaneously use different independent variables to improve stock return rates' modeling. However, in predicting the rate of return, in addition to the modeling method, the degree of co...
متن کامل